Model Selection

Quantized Inference

# Quantized Inference

Llama 3.2 1B Instruct GGUF

Llama-3.2-1B-Instruct is a 1B-parameter instruction-fine-tuned model based on the Llama architecture, offering multiple quantization formats to accommodate different hardware requirements.

Large Language Model Supports Multiple Languages

Mxbai Rerank Large V2 GGUF

mxbai-rerank-large-v2 is a multilingual text reranking model that supports multiple languages and various quantization formats, suitable for different hardware environments.

Text Embedding Supports Multiple Languages

Gemmax2 28 2B 4bit

The GemmaX2-28-2B GGUF quantized model is a collection of quantized versions of the GemmaX2-28-2B-v0.1 translation large language model developed by Xiaomi, supporting machine translation tasks in 28 languages.

Machine Translation

Transformers Supports Multiple Languages

WhisperKit Pro is the commercial version of WhisperKit, focusing on automatic speech recognition (ASR) tasks, supporting quantization technology for efficient speech processing.

Speech Recognition

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase